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AMENDMENTS 

Please incorporate the following amendments into the subject application. 
In the Claims: 

1 . (Currently Amended) A method of identifying a sequence of a nucleic 
acid that is suitable for use as a substrate surface immobilized normalization probe, said 
method comprising: 

(a) identifying a plurality of candidate probe sequences for a target 
nucleic acid based on at least one selection criterion; 

(b) empirically evaluating each of said candidate probe sequences 
under a plurality of different experimental conditions to obtain a collection of 
empirical data values for each of said candidate nucleic acid probe sequences 
for each of said plurality of different experimental conditions, and for each 
member of said plurality of different experimental conditions: 

(i) providing an array of nucleic acid probes immobilized on a 
surface of a solid support, wherein said array includes a substrate surface 
immobilized nucleic acid candidate probe for each of said identified candidate 
probe sequences; 

(ii) subjecting said array to said member of said plurality of 
different experimental conditions; and 

(iii) producing empirical gene expression data; 

(c) clustering said candidate probe sequences into one or more groups 
of candidate probe sequences based on each candidate probe sequence's 
collection of empirical gene expression data values, wherein each of said one or 
more groups exhibits substantially the same performance across said plurality of 
experimental conditions; and 

(d) identifying any sequences of nucleic acids that are suitable for use 
as substrate surface immobilized normalization probes from said plurality of 
candidate probe sequences, comprising evaluating any remaining candidate 
probe sequences not among said one or more groups of candidate probe 
sequences for candidate probe sequences that satisfy a signal intensity threshold 
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and exhibit substantially no variation in signal under said plurality of different 
experimental conditions ; and 

(e) outputtinq said sequences of nucleic acids that are suitable 
for use as substrate surface immobilized normalization probes to a user. 

2. (Original) The method according to Claim 1 , wherein said at least one 
selection criterion employed in said identifying step (a) is chosen from: 

(i) proximity to the 3' end of said target nucleic acid's corresponding 
mRNA transcript; 

(ii) base composition; and 

(iii) lack of homology to other expressed sequences of said target 
nucleic acid's organism. 

3. (Previously Presented) The method according to Claim 2, wherein all 
three of said selection criteria (i), (ii) and (iii) are employed in said identifying step (a). 

4. (Original) The method according to Claim 3, wherein said identifying step 
(a) is further characterized by employing parameters that minimize the number of 
identified candidate probe sequences that overlap with each other. 

5. (Cancelled) 

6. (Previously Presented) The method according to Claim 1 , wherein each 
member of said plurality of different experimental conditions is a different differential 
gene expression assay performed on a tissue or cell line. 

7. (Original) The method according to Claim 1 , said clustering step (c) 
comprises: 

(i) obtaining an expression vector for each of said candidate probe 
sequences using said candidate sequence's collection of empirical data values; 
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(ii) deriving a similarity matrix for the set of said candidate probe 
sequences from said candidate probe sequences' expression vectors; and 

(iii) grouping said candidate probe sequences based on their derived 
similarity. 

8. (Original) The method according to Claim 7, wherein those candidate 
probes that have substantially similar expression patterns are grouped together. 

9. (Original) The method according to Claim 1 , wherein the clustering step 
employs an affinity threshold or another stringency controlling parameter. 

1 0. (Previously Presented) The method according to Claim 1 , wherein a 
candidate probe sequence exhibits substantially no variation in signal under said 
plurality of different experimental conditions if its log ratio is not significantly different 
than zero across said plurality of different experimental conditions. 

1 1 . (Original) The method according to Claim 1 0, wherein said log ratio is 
between about 0.5 and -0.5. 

1 2. (Previously Presented) The method according to Claim 1 , wherein said 
plurality of different experimental conditions is at least 2. 

1 3. (Previously Presented) The method according to Claim 1 2, wherein if no 
non-clustering probes are present after said clustering step (c), said identifying step (d) 
is not performed. 

1 4. (Original) The method according to Claim 1 , wherein at least some of 
said steps are carried out by a computational analysis system. 
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1 5. (Original) A computer-readable medium having recorded thereon a 
program that identifies a sequence of a nucleic acid that is suitable for use as a 
substrate surface immobilized normalization probe according to the method of Claim 1 . 

1 6. (Original) A computational analysis system comprising a computer- 
readable medium according to Claim 15. 

17-25. (Cancelled) 

26. (Previously Presented) A method of identifying a sequence of a nucleic 
acid that is suitable for use as a substrate surface immobilized normalization probe, said 
method comprising: 

(a) identifying a plurality of candidate probe sequences for a target 
nucleic acid based on at least one selection criterion; 

(b) empirically evaluating each of said candidate probe sequences 
under a plurality of different experimental conditions to obtain a collection of 
empirical gene expression data values for each of said candidate nucleic acid 
probe sequences for each of said plurality of different experimental conditions; 

(c) clustering said candidate probe sequences into one or more groups 
of candidate probe sequences based on each candidate probe sequence's 
collection of empirical gene expression data values, wherein each of said one or 
more groups exhibits substantially the same performance across said plurality of 
experimental conditions; and 

(d) identifying any sequences of nucleic acids that are suitable for use 
as substrate surface immobilized normalization probes from said plurality of 
candidate probe sequences, comprising evaluating any remaining candidate 
probe sequences not among said one or more groups of candidate probe 
sequences for candidate probe sequences that satisfy a signal intensity threshold 
and exhibit substantially no variation in signal under said plurality of different 
experimental conditions; and 
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(e) recording the identified candidate probe sequences on a computer- 
readable medium; and 

(f) reporting said identified candidate probe sequences to a user. 

27. (Previously Presented) The method according to Claim 26, wherein said 
at least one selection criterion employed in said identifying step (a) is chosen from: 

(i) proximity to the 3' end of said target nucleic acid's corresponding 
mRNA transcript; 

(ii) base composition; and 

(iii) lack of homology to other expressed sequences of said target 
nucleic acid's organism. 

28. (Previously Presented) The method according to Claim 26, said 
clustering step (c) comprises: 

(i) obtaining an expression vector for each of said candidate probe 
sequences using said candidate sequence's collection of empirical data values; 

(ii) deriving a similarity matrix for the set of said candidate probe 
sequences from said candidate probe sequences' expression vectors; and 

(iii) grouping said candidate probe sequences based on their derived 
similarity. 

29. (Previously Presented) The method according to Claim 28, wherein 
those candidate probes that have substantially similar expression patterns are grouped 
together. 

30. (Previously Presented) The method according to Claim 26, wherein at 
least some of said steps are carried out by a computational analysis system. 
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